Median raphe serotonergic neurons projecting to the interpeduncular nucleus control preference and aversion

Appropriate processing of reward and aversive information is essential for survival. Although a critical role of serotonergic neurons in the dorsal raphe nucleus (DRN) in reward processing has been shown, the lack of rewarding effects with selective serotonin reuptake inhibitors (SSRIs) implies the presence of a discrete serotonergic system playing an opposite role to the DRN in the processing of reward and aversive stimuli. Here, we demonstrated that serotonergic neurons in the median raphe nucleus (MRN) of mice process reward and aversive information in opposite directions to DRN serotonergic neurons. We further identified MRN serotonergic neurons, including those projecting to the interpeduncular nucleus (5-HTMRN→IPN), as a key mediator of reward and aversive stimuli. Moreover, 5-HT receptors, including 5-HT2A receptors in the interpeduncular nucleus, are involved in the aversive properties of MRN serotonergic neural activity. Our findings revealed an essential function of MRN serotonergic neurons, including 5-HTMRN→IPN, in the processing of reward and aversive stimuli.

might play an opposite role to the DRN in the processing of reward and aversive stimuli.
Some research has suggested a critical role of the MRN in the processing of reward and aversive stimuli. Although not highly selective to serotonin neurons, pharmacological inactivation of the MRN induced conditioned place preference and ethanol reinstatement 29,30 , while electrical stimulation of the MRN suppresses lever pressing associated with reward 31 . More recently, cell-type selective recordings and manipulation showed that aversive stimuli increased the activity of vesicular glutamate transporter 2 (vGluT2)-expressing MRN glutamatergic neurons and that activation of these neurons induced place avoidance, although the role of serotonergic neurons in the MRN, which has a distinct projection pattern to vGluT2-expressing MRN glutamatergic neurons, has yet to be determined 32 . To clarify the precise role of MRN serotonin neurons in the processing of reward and aversive stimuli, we selectively recorded changes in serotonergic activity in the MRN and projection areas in response to rewarding and aversive stimuli. Next, we manipulated these serotonergic activities to determine whether their inhibition and activation were sufficient for mimicking behaviors induced by rewarding and aversive stimuli, respectively. Our findings demonstrate the opposite role of the MRN to the DRN in regulating the balance between preference and aversion, and further elucidate the serotonergic pathway and related 5-HT receptor subtypes.

Appetitive reward stimuli inhibit MRN serotonergic neurons, and inhibition of MRN serotonergic neurons is rewarding
To measure changes in neuronal activity of MRN serotonergic neurons, an adeno-associated virus (AAV) bearing GCaMP6s 33 or Venus 34 control under the control of the mouse tryptophan hydroxylase 2 (Tph2) promoter was injected into the MRN 15,35 . Histological analyses of Venus-expressing animals revealed the high specificity and coverage of the promoter (specificity: 91.4 ± 2.5%; coverage: 95.4 ± 1.4%; n = 5; Fig. 1a-d). Then, we recorded MRN serotonergic neuron activity before and after sucrose consumption through fiber photometry in freely moving mice ( Fig. 1e-i, Supplementary Fig. 1a-d, g). Individual analysis revealed that the GCaMP fluorescence started to decrease just after sucrose licking and that the fluorescence remained low throughout the bouts. These phase-dependent fluorescence changes were observed over almost every trial (Fig. 1g, h). The mean GCaMP fluorescence started to decrease after sucrose licking in the MRN of all mice (Fig. 1i). The average signal peak (ΔF/F 0 ) for GCaMP mice and Venus control mice was −2.51 ± 0.78% and −0.31 ± 0.45%, respectively (mean ± s.e.m., n = 7 (GCaMP) and n = 6 (Venus) mice; t 11 = 2.317, *P = 0.0408; Supplementary Fig. 1e). To determine the relationship between the objective value of rewards and changes in serotonergic activity in the MRN, we compared GCaMP fluorescence changes after consumption of 10% and 0.5% sucrose solutions. We found that the size of fluorescence changes after the consumption of 0.5% sucrose was significantly smaller than that after the consumption of 10% sucrose (Fig. 1j). When we analyzed the GCaMP fluorescence changes in the same baseline setting as the previous report on DRN serotonergic neurons 16 , we found a significant increase in GCaMP fluorescence from −1 to −0.5 s as well as a decrease after sucrose licking ( Supplementary Fig. 1i-l). As the tested mice should stop locomotion to lick sucrose solution from the spout, licking onset was time-locked to the termination of locomotion. To determine the extent to which changes in GCaMP fluorescence were affected by the termination of locomotion, we analyzed GCaMP fluorescence in the MRN before and after the termination of locomotion and compared the extent of the changes to that of sucrose-induced changes. We found a small but significant GCaMP fluorescence decrease from 0.3 to 1.2 s after the termination of locomotion ( Supplementary  Fig. 1m), whereas the size of the signal changes (mean ΔF/F 0 ) after the termination of locomotion was much smaller than that after sucrose licking (t 6 = 2.935, *P = 0.0261; Supplementary Fig. 1n).
To assess the causal relationship between decreases in MRN activity and appetitive characteristics, we tested the effect of optogenetic inhibition of MRN serotonin neurons in the conditioned place preference (CPP) test 15,36 . MRN serotonin neurons were selectively transduced with archaerhodopsin (eArchT), a light-activated proton pump, for optogenetic inhibition 37 (Fig. 1k, l and Supplementary  Fig. 1h). Ex vivo electrophysiology experiment revealed that activity of eArchT-expressing cells was significantly attenuated by green light illumination and was reversed to the basal level after light illumination (before: 2.0 ± 0.33 Hz, light: 0.038 ± 0.0028 Hz, after: 1.5 ± 0.33 Hz, 30 sweeps in 6 cells from 3 mice; ***P = 0.0005 by one-way repeated measures ANOVA, **P = 0.0062 (before vs light), P = 0.2209 (before vs after), *P = 0.0221 (light vs after) by paired t-test, Supplementary  Fig. 1o, p). In the pretest session, the mice were allowed to freely explore two chambers with different wall colors and textures to facilitate discrimination. The time spent in each chamber was recorded. In the conditioning session, the mice were confined to one of two chambers and allowed to explore without optogenetic stimulation for 20 min. After at least 4 h, the mice were confined to the other chamber for 20 min, during which they were subjected to optogenetic inhibition of MRN serotonergic neurons (cycles of 20-s light ON and 10-s light OFF) and received the same treatment on the following day. The posttest session was performed the day following the second conditioning session. In the posttest session, the mice were allowed to freely explore the two chambers again, and the time spent in each chamber was recorded. The CPP score, an index of the rewarding property of manipulation, was defined as the difference in the time spent in the optogenetic manipulation-paired chamber during the posttest and pretest sessions. The CPP scores of the eArchT-expressing animals were significantly higher than those of the Venus control animals (Fig. 1m). The increase in time spent in the optogenetic inhibitionpaired chamber cannot be accounted for by locomotor suppression, as locomotor activity in the posttest session in the eArchT-expressing mice was slightly but significantly higher than that in Venus control mice ( Supplementary Fig. 1f). Consistent with this finding, the time spent in the optogenetic inhibition-paired chamber significantly increased after conditioning sessions in the eArchT mice but not in the Venus mice (Fig. 1n). Optogenetic inhibition of 20 s per cycle in the CPP test was far longer than the duration of the decreased activity observed in our fiber photometry recordings ( Fig. 1g-i). Therefore, we further investigated whether brief inhibition of MRN serotonergic neurons was sufficient for inducing reward-related behavioral changes in an operant conditioning task. In this task, optogenetic inhibition was briefly applied for 3 s when the mice performed nose poking (Fig. 1o). The number of nose pokes during the 30-min session increased in a time-dependent manner in the eArchT-expressing mice (Fig. 1p). These results indicated that inhibition of MRN serotonergic neurons reinforced reward-seeking behavior.

Aversive stimuli activate MRN serotonergic neurons, and activation of MRN serotonergic neurons is aversive
To investigate the effects of an aversive stimulus on the neural activity of MRN serotonergic neurons, we measured GCaMP fluorescence in the MRN before and after a tail pinch (Fig. 2a, b and Supplementary  Fig. 2h). The fluorescence in the MRN increased after a tail pinch in all mice (Fig. 2c), whereas it did not significantly change after control treatment (the mouse was approached with the clamp, but the tail was not pinched) ( Supplementary Fig. 2d-f). The average signal peak (ΔF/F 0 ) for the GCaMP mice and Venus control mice was 3.36 ± 0.68% and 0.04 ± 0.54%, respectively (mean ± s.e.m., n = 7 mice; t 12 = 3.851, **P = 0.0023; Supplementary Fig. 2a-c). As tail pinch increased locomotion, the tail pinch was time-locked to the initiation of locomotion. To determine the extent to which changes in GCaMP fluorescence were affected by the initiation of locomotion, we analyzed GCaMP fluorescence in the MRN before and after the initiation of locomotion and compared the size of the changes to that of tail pinch-induced changes. We found a small but significant GCaMP fluorescence increase after the initiation of locomotion ( Supplementary Fig. 3a), whereas the size of the signal changes (mean ΔF/F 0 ) after the initiation of locomotion was much smaller than that after tail pinch (t 6 = 4.127, **P = 0.0062; Supplementary Fig. 3b). In addition, we measured GCaMP fluorescence before and after licking with water and quinine solution, another representative aversive stimulus. As the tested mice were water-deprived, water itself was predicted to have a positive value. We observed a significant decrease in GCaMP fluorescence in response to water, whereas quinine (5 mM) blunted this decrease (water: t 13 = 2.352, *P = 0.0351; quinine: t 13 = 0.2024, P = 0.8428; Supplementary Fig. 3c-f). We then measured GCaMP fluorescence in MRN serotonin neurons in response to auditory cues associated with foot shocks, using a cue-induced fear conditioning paradigm 38 . One day after training, GCaMP fluorescence in the MRN was measured before and after the onset of the auditory cues associated with foot shocks. We found that GCaMP fluorescence increased after the onset of the auditory cue (P < 0.05, n = 5 mice; Supplementary Fig. 3g).
To examine the causal relationship between this increase in activity and aversion-related behaviors, we expressed CheRiff, an excitatory optogenetic actuator 39   h Top: heatmap of signals (red-blue, high-low). One licking bout per row. Bottom: averaged GCaMP signals of one mouse. n = 10 bouts. i Mean GCaMP signals for seven mice. Lines and shaded areas indicate mean and s.e.m., respectively. Red and blue segments indicate a statistically significant increase and decrease from the baseline (P < 0.05; permutation test). j GCaMP fluorescence response to 10% and 0.5% sucrose solutions. Two-tailed paired t-test t 4 = 5.365, **P = 0.0058, n = 5 mice. l CPP test. m CPP scores calculated as spent time in a light stimulation-associated compartment in the posttest subtracted by that in the pretest in the Venus and eArchT mice (two-tailed unpaired t-test, t 19 = 2.804, **P = 0.0113, n = 8 (Venus) and 13 (eArchT) mice). n Spent time in the compartment associated with light stimulation was significantly increased after conditioning sessions in the eArchT mice but not in the Venus control mice (eArchT: two-tailed paired t-test (eArchT Pre vs eArchT Post), t 12 = 4.810, ***P = 0.0004, n = 13 mice; Venus: two-tailed paired t-test (Venus Pre vs Venus Post), t 7 = 0.3553, P = 0.7328, n = 8 mice). o Schematic of the nose poke self-stimulation test. Greenlight (3 s duration, 1 pulse/poke, 5 mW) was delivered when mice performed nose poke responses. p The number of nose poke in the Venus and eArchT mice (two-way repeated-measures ANOVA, virus (Venus or eArchT) × time (day 1, 2, 3, or 4) interaction F 3,60 = 1.925, P = 0.1351, the effect of virus F 1,20 = 38.40, ***P < 0.001, the effect of time F 3,60 = 0.7157, P = 0.5464, n = 12 (Venus) and 10 (eArchT) mice). Data are presented as mean ± s.e.m. Error bars indicate s.e.m. Source data are provided as a Source Data file.
implanted a small optical fiber targeting the MRN for optogenetic stimulation ( Fig. 2d and Supplementary Fig. 2i). Ex vivo electrophysiology experiment revealed that blue light illumination induced firing of CheRiff-expressing cells, which was time-locked to the illumination (80.4 ± 5.5% fidelity (32.2 ± 2.2 action potentials/40 pulses, n = 5 cells from 2 mice, Supplementary Fig. 3h, i). The number of serotonergic neurons expressing c-Fos, a marker of neuronal activity, in CheRiff-expressing animals after 20-min optogenetic stimulation was significantly larger than that in Venus-control animals (Supplementary Fig. 4a-d), confirming CheRiff expression in MRN serotonergic neurons and successful activation of MRN serotonergic neurons with blue light. To assess the contribution of these neurons to aversion-related behavior, we performed a conditioned place aversion (CPA) test (Fig. 2e). The CPA score, used as a proxy of aversion, was defined as the difference in time spent in the optogenetic stimulation-paired chamber (20 Hz) in the posttest session and in the pretest session. The CPA score in the CheRiff-expressing animals was significantly lower than that in the Venus mice ( Fig. 2f and Supplementary Fig. 2g). Consistent with this finding, the time spent in the optogenetic stimulation-paired chamber in the posttest session was significantly shorter than that in the pretest session in the CheRiff mice but not in the Venus mice (Fig. 2g), indicating that activation of MRN serotonergic neurons in a neutral context resulted in subsequent avoidance of this context. We then investigated the effect of brief activation of MRN serotonergic neurons on the processing of appetitive stimuli (Figs. 3 and 4). In the two-bottle choice test, the mice were presented with two drinking bottles filled with a 10% sucrose solution. Consumption of the reward from only one of the two bottles triggered optogenetic stimulation ( Fig. 3a-d, Supplementary Fig. 5a). Sucrose solution intake from the bottle associated with optogenetic stimulation (20 Hz) was significantly lower than that without stimulation in the CheRiff-expressing mice but not in the Venus-control mice (Fig. 3c), whereas the total intake did not differ between the two groups (Fig. 3d). These results indicated that the activation of MRN serotonergic neurons was aversive.
Although very simple behavior was required in the above tests, performance in these tests was still affected by learning or planning processes. To address this issue, we examined the effect of optogenetic stimulation of MRN serotonergic neurons on anticipatory responses, which develop as a result of learning or planning, and consummatory responses to an automatically delivered reward in a head-fixed setup ( Fig. 3e-i and Supplementary Fig. 5b-d). In the pretest session (0-10 min; Pre) and posttest session (20-30 min; Post), mice received 2 μL of 10% sucrose solution every 10 seconds without optogenetic stimulation, and the number of licks per 1 s was recorded. In the stimulation session (10-20 min; Stim), sucrose solution was delivered as in the pretest and posttest sessions, but blue light was applied to the MRN for 2 s from the onset of the sucrose solution delivery. The application of blue light (20 Hz) significantly reduced the frequency of licking behavior during the Stim and Post phases compared to the Pre phase only in the CheRiff mice. The reduction in consummatory licking appeared even in the first trial, suggesting that the activation of MRN serotonergic neurons could suppress the consumption of reward in a learning and planning process-independent manner. No significant differences in the frequency of licking behavior across phases were observed in the Venus mice. Another potential concern was that the nonselective expression of CheRiff might have affected the observed results because approximately 10% of CheRiff-expressing neurons were speculated to be nonserotonergic (Fig. 1d). To address this concern, we performed a similar experiment in Tph2-tTA::tetO-ChR2(C128S)-eYFP bigenic mice 40 where virtually all C128S-expressing cells are serotonergic (Fig. 4a-c, Supplementary Fig. 5e) 41 . We found that blue light application over the MRN significantly reduced the frequency of licking behavior during the Stim phase compared to the Pre phase at −4 s, −2 s, and 0 s. We also found a significantly higher frequency of licking behavior during the Post phase than during the Stim phase at 0 s ( Fig. 4a-c). The reduction of licking behavior looks more prominent in the bigenic mice than in CheRiff mice. As we injected AAV into the rostral part of the MRN in CheRiff mice, it is likely that serotonergic neurons in the caudal part of the MRN were more faithfully manipulated in bigenic mice. . a Schematic representation of AAV injection and fiber implantation. b Schematic of the fiber photometry setup and tail pinch task in free-moving mice. c Top: heatmap of GCaMP signals (red-blue, high-low) for each trial (one trial per mouse). Each row represents the data of one mouse. n = 7 mice. Bottom: averaged traces of GCaMP signals. Lines and shaded areas indicate mean and s.e.m., respectively. The red segment indicates a statistically significant increase from the baseline (P < 0.05; permutation test). d Schematic representation of AAV injection and fiber implantation. e Schematic of the CPA test. On day 1, mice underwent the two-compartment test without light stimulation (pretest: 15 min). On day 2 and 3, mice underwent conditioning sessions (20 min × two times per day) for two consecutive days. On day 4, mice again underwent the two-compartment test without light stimulation (posttest: 15 min). f Stimulation of MRN serotonergic neurons promoted CPA. The CPA scores calculated as spent time in a light stimulation-associated compartment in the posttest subtracted by that in the pretest in the Venus and CheRiff mice (two-tailed unpaired t-test (Venus vs CheRiff), t 15 = 3.401, **P = 0.0040, n = 8 (Venus) and 9 (CheRiff) mice). g Spent time in the compartment associated with light stimulation was significantly decreased after conditioning sessions in the CheRiff mice but not in the Venus control mice (CheRiff: two-tailed paired t-test (CheRiff Pre vs CheRiff Post), t 8 = 4.998, **P = 0.0011, n = 9 mice; Venus: two-tailed paired t-test (Venus Pre vs Venus Post), t 7 = 0.4957, P = 0.6353, n = 8 mice). Data are presented as mean ± s.e.m. Error bars indicate s.e.m. Source data are provided as a Source Data file.
To more directly assess hedonic responses that do not involve goal-directed behavior, we intraorally infused a drop of sucrose solution and observed taste reactivity 42 and facial expression 43 in Tph2-tTA::tetO-ChR2(C128S)-eYFP bigenic mice ( Fig. 4d-h). In this task, the mice did not receive any training, and no predictive cue was paired with the unconditioned stimuli. This design enabled us to purely assess the effects of optogenetic manipulation on hedonic responses. We found that blue light application to the MRN significantly reduced the number of tongue reactions and sweet sucrose-driven facial expressions during the Stim phase compared to the Pre phase (Fig. 4f, h). Tongue reactions were recovered during the Post phase, consistent with Figs. 3f-i, 4a-c, suggesting that the sucrose solution was still appetitive after MRN manipulation. Interestingly, however, the facial expression did not return to the baseline expression in the Post phase, suggesting that the previous MRN manipulation partially reduced the value of the sucrose solution. Collectively, these results indicated that increased activity in MRN serotonergic neurons mediated aversion.

MRN serotonergic neurons densely innervate the interpeduncular nucleus, ventral hippocampus, and medial habenula
To elucidate the serotonergic pathway regulating the processing of reward and aversive stimuli, we focused on the MRN-interpeduncular nucleus (IPN) pathway because the IPN has been shown to play a key role in regulating nicotine preference 44,45 , and anterograde tracing of MRN serotonergic neurons has shown that the IPN is densely innervated by these neurons 46 . To confirm the presence of MRN serotonergic projections transduced by AAV, immunohistochemical labeling of eGFP and serotonin transporter (SERT), a marker for serotonergic axons, was performed in the CheRiff-eGFP-expressing mice. We found fiber-like structures positive for eGFP and SERT in the IPN (Fig. 5a, b). Moreover, fluorescent latex microspheres (Retrobeads) injected into the IPN (Fig. 5e, f) were observed in MRN serotonergic neurons 4 weeks after Retrobeads injection. In addition to the IPN, we found fiber-like structures positive for     eGFP in the hippocampus and medial habenula (MHb). Previous reports have shown that MHb plays a critical role in the processing of reward and aversion [47][48][49] . Previous studies have indicated that the ventral hippocampus (vHP), but not the dorsal hippocampus (dHP), is necessary for regulating approach/avoidance behaviors and processing an aversive predatory odor 50,51 . We further performed immunohistochemical labeling of eGFP and SERT in the vHP and MHb of the CheRiff-eGFP-expressing mice. Similar to the IPN, we found fiber-like structures positive for eGFP and SERT in both the vHP (Fig. 5c) and MHb (Fig. 5d). Moreover, fluorescent Retrobeads injected into the vHP (Fig. 5g, h) and MHb  Supplementary Fig. 6i-k). However, when we analyzed c-Fos-positive pyramidal and non-pyramidal neurons in the vHP separately, we found significantly more c-Fos-positive non-pyramidal neurons in CheRiff mice than in Venus mice, whereas there was no significant difference between groups in c-Fos-positive pyramidal neurons (vHP (non-pyramidal): n = 6 (Venus) and n = 9 (CheRiff), t 13 = 3.806, **P = 0.0022, Supplementary Fig. 6g; vHP (pyramidal): n = 6 (Venus) and n = 9 (CheRiff), t 13 = 0.4284, P = 0.6754, Supplementary Fig. 6h). These results indicate that optogenetic activation of MRN serotonergic neurons increased neuronal activity in the IPN and vHP.

Appetitive stimuli inhibit MRN serotonergic neurons projecting to the IPN and vHP, and their inhibition is rewarding
To measure the changes in the activity of axon terminals of MRN serotonergic neurons, axon-GCaMP6s, a variant of GCaMP6s with enhanced transport to axon terminals 52 , was expressed in MRN  MHb, respectively. We found that axon-GCaMP fluorescence in the IPN significantly decreased during sucrose consumption ( Fig. 6b-d), similar to the results in MRN cell bodies (Fig. 1i). The axon-GCaMP fluorescence in the vHP also decreased during sucrose consumption but to a smaller extent than that in the IPN or MRN (Fig. 6i-k). The extent of fluorescence changes in the IPN and vHP in the axon-GCaMP6s-expressing animals was significantly larger than that in the Venus control animals (IPN: n = 7 mice per group, t 7.895 = 4.302, **P = 0.0027; vHP: n = 6 mice per group, t 10 = 4.228, **P = 0.0018; Supplementary Fig. 7b-e, h-k). In contrast, the size of the fluorescence changes in the MHb in the axon-GCaMP6s-expressing mice was similar to that in the Venus mice (n = 6 mice per group, t 5.561 = 1.458, P = 0.1990; Supplementary Fig. 7n-q), although we found a slight but significant decrease in fluorescence 2-3 s after lick onset from baseline in the axon-GCaMP6s-expressing mice ( Fig. 6p-r). Moreover, we measured axon-GCaMP fluorescence in the lateral habenula (LHb) and the paraventricular nucleus of the thalamus (PVT), where clear projections of MRN serotonergic neurons were observed (Fig. 5d). However, we did not find significant fluorescence changes in either nucleus in response to sucrose consumption compared with the Venus control (Supplementary Fig. 7s-v). To assess whether these changes in activity in serotonergic projections from the MRN were causally involved in the regulation of reward-related behavior, we examined the effect of inhibition of these projections in the CPP test. We injected AAV-mTph2-eArchT or AAV-mTph2-Venus into the MRN and implanted an optical fiber into the IPN, vHP or MHb (Fig. 6e, l, s, and Supplementary  Fig. 8e-h). Optogenetic inhibition in the IPN and vHP during conditioning sessions significantly increased CPP scores (Fig. 6f, m and Supplementary Fig. 7f, l) and the time spent in the chamber associated with optogenetic inhibition (Fig. 6g, n). In contrast, optogenetic inhibition in the MHb did not significantly affect CPP scores or the time spent in the chamber associated with optogenetic inhibition (Fig. 6t, u and Supplementary Fig. 7r). These data indicated a critical and moderate role of the 5-HT MRN→IPN and 5-HT MRN→vHP pathways, respectively, in appropriate processing of reward information.
Aversive stimuli activate MRN serotonergic neurons projecting to the IPN and vHP, and their activation is aversive We then measured changes in the activity of axon terminals of serotonergic neurons following the presentation of aversive stimuli ( Fig. 7a, f, k, and Supplementary Figs. 9a, e, i and 10a-d). Similar to the responses to a reward ( Fig. 6), we found that axon-GCaMP fluorescence in the IPN and vHP, but not in the MHb, significantly increased after a tail pinch ( Fig. 7b, g, l and Supplementary Fig. 9b, f, j). However, fluorescence changes in the IPN appeared immediately after the aversive stimuli, while those in the vHP appeared 1 to 2 s later. Thus, we speculated that serotonergic terminals in the IPN are primarily involved in the processing of reward and aversive stimuli compared to those in the vHP. The size of fluorescence changes in the IPN and vHP, but not in the MHb, in the axon-GCaMP6s-expressing animals were significantly larger than those in the Venus control mice (IPN: n = 6 (Venus) and n = 7 (axon-GCaMP) mice, t 11 = 4.124, **P = 0.0017; vHP: n = 6 mice per group, t 10 = 2.437, *P = 0.0350; MHb: n = 6 mice per group, t 10 = 1.962, P = 0.0782; Supplementary Fig. 9c, g, k). To assess the causal relationship between the increase in activity and the expression of aversion-related behaviors, we expressed CheRiff or Venus in the MRN and implanted an optical fiber in the IPN, vHP or MHb (Fig. 7c, h, m, and Supplementary Fig. 10e-h). In the CPA test, optogenetic stimulation in the IPN and vHP significantly decreased the CPA score (Fig. 7d, i, and Supplementary Fig. 9d, h) and the time spent in the chamber associated with optogenetic stimulation (Fig. 7e, j), whereas that in the MHb did not (Fig. 7n, o, and Supplementary Fig. 9l). In Tph2-tTA::tetO-ChR2(C128S)-eYFP bigenic mice where virtually all ChR2-expressing neurons were serotonergic, we found significantly more c-Fos-positive cells in IPN of the bigenic mice than control mice after optogenetic stimulation in the MRN (n = 5 (control) and n = 4 (bigenic mice), t 7 = 4.21, **P = 0.004, Supplementary Fig. 9m). Additionally, optogenetic stimulation in the IPN of the bigenic mice significantly decreased the CPA score and the time spent in the chamber associated with optogenetic stimulation (n = 7 (control) and n = 7 (bigenic mice), (CPA score) t 12 = 2.686, *P = 0.020, (spent time-ChR2 bigenic) t 6 = 4.212, **P = 0.0056, Supplementary Fig. 9n, o). Considering the spatial closeness between IPN and MRN and the presence of serotonergic fibers passing through the IPN, optogenetic stimulation in the IPN may affect serotonergic projections from the MRN to other brain regions, including vHP, which leads to an apparently strong effect induced by IPN stimulation. Therefore, we performed optogenetic stimulation of 5-HT MRN→IPN Supplementary Fig. 11b). In vehicletreated mice, optogenetic stimulation in the MRN significantly decreased the CPA score, whereas pretreatment with WAY-100635 or MDL-100907 almost completely abolished this effect ( Fig. 8c and Supplementary Fig. 11a). Similarly, optogenetic stimulation significantly decreased the time spent in the chamber associated with stimulation in the mice pretreated with the vehicle but not in the mice pretreated with WAY-100635 or MDL-100907 (Fig. 8d). To identify the brain region responsible for these effects, we then locally administered the antagonists. We found that the neural activity of the 5-HT MRN→IPN pathway promptly and more prominently changed in responses to rewarding and aversive stimuli compared to that of the 5-HT MRN→vHP pathway. Therefore, we evaluated the effects of intra-IPN administration of the antagonists (Fig. 8e). Five minutes before the conditioning session in the CPA test, we administered WAY-100635 (1 μg in 0.2 μL), MDL-100907 (0.1 μg in 0.2 μL) or a vehicle (saline, 0.2 μL) through injection cannula implanted into the IPN ( Fig. 8f and Supplementary Fig. 11d). We found that intra-IPN administration of MDL-100907, but not of WAY-100635, reversed the reduction in the CPA score induced by the activation of the MRN (Fig. 8g and Supplementary Fig. 11c). Moreover, optogenetic stimulation significantly decreased the time spent in the chamber associated with stimulation in vehicle-or WAY-100635-treated mice but not in MDL-100907-treated mice (Fig. 8h). Immunohistochemical analysis using two different 5-HT 2A receptor antibodies revealed immunoreactivity with cell soma-like morphology in the IPN ( Supplementary Fig. 11g, h). We also found 5-HT 2A mRNA expression in the IPN using RT-PCR (Supplementary Fig. 11i). We examined the effects of intra-IPN administration of TCB-2, a 5-HT 2A agonist, on CPA scores without optogenetic manipulation. We found that intra-IPN administration of TCB-2 (0.05 μg/0.5 μL) significantly decreased the CPA score and the time spent in the chamber associated with TCB-2 (

Discussion
Serotonin, a monoaminergic neurotransmitter, has long been associated with a variety of brain functions, including the processing of reward/aversive information, mood regulation, and memory 56,57 . Our study identified and clarified the critical role of MRN serotonergic neurons and their projections to the IPN in the processing of reward and aversive information through cell type-specific activity recordings and optogenetic manipulation. Several previous studies have separately indicated that either the MRN or IPN was involved in the processing of stimuli with a negative valence [29][30][31][32]44,45 . We replicated these findings using more selective manipulations and more direct measurements, such as taste reactivity and facial expression, which do not involve goal-directed behavior, and further identified the role of serotonergic projections between these two brain regions, the MRN and IPN, in the processing of reward and aversive information. Although we selectively recorded and manipulated MRN serotonin neurons only, our results were similar to those of vGluT2-expressing MRN glutamatergic neurons 32 . Further studies are required to determine whether the roles of serotonin neurons in the MRN could be discriminated from the roles of vGluT2-expressing MRN glutamatergic neurons. Our results identified the 5-HT MRN→IPN pathway as a crucial circuit for the appropriate processing of reward and aversive information. We , 5-HT 1B 64 , and 5-HT 4 65 . Considering the limitation of pharmacological intervention with regard to receptor selectivity, further analysis using gene knockdown and Cas9-mediated in vivo gene knockout 66 is necessary for determining the 5-HT receptors critical for processing aversive stimuli.
Appetitive stimuli suppressed the activity of MRN serotonergic neurons (Fig. 1f-i), while they activated DRN serotonergic neurons 16,17 ( Supplementary Fig. 12a-h). We and others have shown that activation of DRN serotonergic neurons produces reward-related behaviors mainly through its terminals in the ventral tegmental area (VTA) 14,15 . These reports indicated that the activity of DRN serotonergic neurons plays a role in reward signaling. Anatomical studies have demonstrated that afferents from the IPN are found in the LHb 67 . Lammel et al. have shown that LHb inputs to the VTA preferentially activate mesoprefrontal dopaminergic neurons, whose activation is aversive 68 . Therefore, it is possible that activation of the 5-HT MRN→IPN pathway by aversive stimuli activates the LHb-VTA-medial prefrontal cortex pathway, which codes for a negative value. Moreover, a previous study showed that activation of LDTg-projecting IPN neurons induced inhibition of VTAprojecting LDTg neurons and place avoidance 45 . Interestingly, mesolimbic dopaminergic neurons are innervated by LDTg neurons, whereas activation of VTA-projecting LDTg neurons induced place preference 68 . Therefore, it is also possible that activation of the 5-HT MRN→IPN pathway by aversive stimuli inhibits the LDTg-VTA-nucleus accumbens (NAc) pathway, which codes for a positive value. On the other hand, vesicular glutamate transporter 3 (vGluT3)-expressing DRN neurons, which also contain serotonin, strongly innervate mesolimbic dopaminergic neurons 69 . Wang et al. revealed that activation of VTA-projecting DRN serotonergic neurons increases extracellular dopamine levels in the NAc and induces a place preference 70 . Taken together, activation of DRN serotonergic neurons by rewarding stimuli activates the VTA-NAc pathway, which codes positive value. Collectively, it is possible that signaling from the MRN and DRN converges in mesolimbic dopaminergic neurons in the VTA, while signaling from the MRN also regulates mesoprefrontal dopaminergic neurons in the VTA, although further investigation is necessary to determine to what extent the activation/inhibition of MRN serotonergic neurons affects the activity of mesolimbic and mesoprefrontal dopaminergic neurons. SSRIs, which upregulate serotonergic neurotransmission from both the MRN and DRN and thus enhance these discrete serotonergic systems coding for opposite values, may lead to less efficacy on anhedonia and MHb (t) (IPN: t 16 = 2.237, *P = 0.0399, n = 9 mice; vHP: t 16 = 2.260, *P = 0.0382, n = 9 mice; MHb: t 16 = 1.141, P = 0.2705, n = 9 mice). g, n, u, Spent time in the compartment associated with light stimulation in the IPN (g) and vHP (n) but not MHb (u) was significantly increased after conditioning sessions in the eArchT mice (IPN-eArchT: (Pre vs Post), t 8 = 3.486, **P = 0.0082, n = 9 mice; IPN-Venus: (Pre vs Post), t 8 = 0.8922, P = 0.3983, n = 9 mice; vHP-eArchT: (Pre vs Post), t 8 = 2.800, *P = 0.0232, n = 9 mice; vHP-Venus: (Pre vs Post), t 8 = 0.2371, P = 0.8186, n = 9 mice; MHb-eArchT: (Pre vs Post), t 8 = 0.2155, P = 0.8347, n = 9 mice; MHb-Venus: (Pre vs Post), t 8 = 1.233, P = 0.2527, n = 9 mice). Two-tailed unpaired t-test (f, m, t) or twotailed paired t-test (g, n, u) were used for statistical analysis. Data are presented as mean ± s.e.m. Error bars indicate s.e.m. Source data are provided as a Source Data file.  in major depression. From this perspective, differentially expressed genes between MRN and DRN serotonergic neurons 71,72 will be an ideal drug target to treat major depression through pathway-specific modulation of serotonergic neurotransmission. Through analysis of MRN GCaMP fluorescence in the same baseline setting as that of a previous report on DRN serotonergic neurons 16 , we found increased GCaMP fluorescence before sucrose licking. As the tested mice had experienced a number of sucrose licks before measurement to acclimate to the setup, further measurement using appetitive Pavlovian conditioning, as previously reported in the DRN 17 , is needed to determine whether increased activity in MRN serotonin neurons is associated with reward anticipation.
Previous reports have shown that MRN neurons play a critical role in the regulation of hippocampal theta oscillations, locomotion, and memory function 57,73-76 , as well as in the processing of appetitive and aversive stimuli. Although the extent of suppression was much smaller than that of sucrose, we found that immobility induced a significant decrease in GCaMP fluorescence in the MRN. Similarly, the initiation of locomotion induced a significant increase in GCaMP fluorescence in the MRN, although the size of the increase was substantially smaller than that of the tail pinch ( Supplementary Fig. 1m, n and 3a, b). Additionally, quinine, another aversive stimulus 16,17 , blunted waterinduced decreases in GCaMP fluorescence in MRN. Furthermore, auditory cues associated with foot shocks, which induce immobility in a new context 38 , increased GCaMP fluorescence in the MRN. Collectively, these results indicate that MRN serotonergic neurons respond to rewarding and aversive stimuli irrespective of locomotor changes. At the same time, our data indicate that locomotor activity affects the   activity of MRN serotonergic neurons, although the size of the changes was relatively small. This could stem from the number of neurons responding to each stimulus, as fiber photometry measures the fluorescence of a whole nucleus as a proxy for population activity. Therefore, further analysis using in vivo endoscopy is required to determine whether locomotor changes robustly affect the activity of a subpopulation of MRN serotonergic neurons. Additionally, we have to note that it is possible that the difference in the size of Ca 2+ response to the initiation of locomotion and tail-pinch may be due to the difference in locomotor speed because we did not align the Ca 2+ signals to a change of locomotor speed. Although a recent report has indicated that hippocampal theta oscillations are mainly modulated by nonserotonergic MRN neurons 77 , it is possible that the manipulation of MRN serotonergic neurons affects memory formation. However, we found that activation and inhibition induced strong CPA and CPP, both of which required intact memory function. These results indicate that mice have sufficient memory to associate the conditioned chamber with valence, even in the presence of manipulation of MRN serotonergic neurons. We also found that activation of MRN serotonin neurons reduced consummatory licking in the head-fixed setup and hedonic response in facial expression analysis, both of which are much less amenable to confounding memory function. Although these results strongly support our notion that inhibition and activation of MRN serotonergic neurons are rewarding and aversive, respectively, we cannot rule out the possibility that the memory-enhancing effect of manipulation of MRN serotonergic neurons overrode CPA and CPP by manipulating MRN serotonergic neurons. Further analysis of the role of MRN serotonin neurons in regulating memory formation and hippocampal theta oscillation will be crucial for a better and more comprehensive understanding of the role of MRN serotonergic neurons. We found that the activity of serotonergic terminals in the vHP was also altered in response to aversive or reward stimuli, but the alterations were delayed compared to those in the IPN (Figs. 6 and 7). Given that previous studies have shown that the 5-HT MRN→vHP pathway regulates aversive memory and anxious states 78,79 , this pathway might mediate indirect or secondary effects of the processing of aversive stimuli rather than the processing itself. It should also be noted that the 5-HT MRN→vHP pathway is activated during sustained goal-directed actions, although it is not directly related to the processing of reward and aversive stimuli 80 . Mice might perceive the sustained action as aversive or stressful, although this sustained activity would result in preferable outcomes later. Serotonin 5-HT 3 receptors in the ventral hippocampus are necessary for sustaining goal-directed actions 80 , while aversive memory and anxiety are regulated by 5-HT 7 and 5-HT 2C receptors in the vHP, respectively 79,81 .
Although we focused on the IPN, vHP, and MHb as MRNprojecting regions, we do not necessarily deny the roles of other MRN-projecting regions in the processing of reward and aversive stimuli. MRN serotonergic neurons project their fibers not only to the vHP but also to the dHP. Luchetti et al. identified two types of serotonergic fibers from the MRN to dHP through two-photon calcium imaging of serotonergic fibers in dHP 80 . They found that the two types respond to either reward or movement and showed that inhibition of the 5-HT MRN→dHP pathway decreased the time spent in the area associated with reward presentation. In contrast with this report, we demonstrated that 5-HT MRN→vHP,IPN pathways negatively and positively responded to reward and aversive stimuli, respectively. We speculate that such diversity of roles played by MRN serotonergic neurons could be attributed to different subpopulations in MRN serotonergic neurons, each innervating different brain areas and responding to a reward in opposite ways. Of note, recent single-cell RNA-seq analyses have clearly demonstrated that serotonergic neurons in the DRN and MRN can be subdivided into more than ten clusters 71,72 . Moreover, our AAV infected most of the serotonin neurons present in the MRN (Fig. 1b-d in the present study), while AAV in Luchetti et al. infected a small proportion of serotonin neurons in the MRN (Fig. 2B in Luchetti et al.) 82 . Therefore, it is possible that each of these subpopulations has a distinct projection pattern and differentially responds to a reward.
The MHb plays a central role in regulating reward processing, mood, and decision-making 47,83,84 . Although this nucleus is densely innervated by MRN serotonergic neurons (Fig. 5) 46 , no significant effect of the 5-HT MRN→MHb manipulation was observed in either the CPP or CPA test (Figs. 6 and 7). Previous reports have demonstrated that the activity of MHb neurons is increased by nicotine 85 , which leads to an increase in extracellular dopamine levels in the NAc via the activation of nicotinic receptors in the MHb 86 , indicating a critical role of MHb in the effect of addictive drugs. Consistent with this notion, intra-MHb administration of an antagonist for nicotinic receptors precipitated withdrawal symptoms in chronically nicotine-treated mice 87 . Interestingly, intra-MHb administration of nicotinic receptor antagonists attenuated morphine self-administration but did not affect responding for sucrose 48 . Therefore, it is possible that manipulating the 5-HT MRN→MHb pathway may affect reward-related behaviors after chronic treatment with addictive drugs.
Although our results strongly suggest a key role of 5-HT MRN→IPN , there is a limitation of our study, mainly due to the presence of serotonergic fibers passing through the IPN and the spatial closeness between the IPN and MRN. Considering the spatial closeness of the IPN and MRN, it is possible that GCaMP fluorescence in the MRN may affect photometry measurements in the IPN. However, the histochemical analysis revealed that optogenetic manipulation in the IPN with much more intense excitation light than photometry measurements did not affect the activity in the MRN (Supplementary Fig. 6n). Hence, it is less likely that axon-GCaMP fluorescence in the MRN affected axon-GCaMP fluorescence in the IPN. In contrast, we cannot rule out the possibility that the axon-GCaMP fluorescence of serotonergic fibers passing through the IPN affects that in the IPN. However, a previous report has shown that fluorescent changes in axon-GCaMP6 in presynaptic boutons are twice as large as those in the axon shaft, whereas there was no difference in the baseline fluorescence of axon-GCaMP6 in presynaptic boutons and axon shaft 52 . Hence, the larger the contribution of baseline fluorescence derived from the fiber passing through the IPN, the less the contribution of axon-GCaMP fluorescence of presynaptic boutons in the IPN (i.e., 5-HT MRN→IPN ) to fluorescence changes. If fluorescence changes in the IPN may largely originate from the axon shaft passing through the IPN, there should be other brain regions where much larger fluorescent changes are observed than in the IPN. However, we found that fluorescence changes (relative to baseline fluorescence (F 0 )) in the IPN were much larger than those in other brain regions, including vHP, MHb, LHb, and PVT (Figs. 6 and 7, Supplementary Fig. 7s-v), where dense MRN serotonergic projections were observed. Collectively, these results indicate a relatively large contribution of 5-HT MRN→IPN to the fluorescence changes in the IPN. Nevertheless, we cannot completely rule out the possibility that other MRN serotonergic projections respond to appetitive and aversive stimuli. For retrograde labeling, previous reports have indicated that Retrobeads enable retrograde labeling with minimal entry into undamaged fibers of passage and minimal diffusion at the injection site 88 , indicating that retrograde labeling from the IPN to the MRN serotonergic neurons, which we observed here, was mainly due to 5-HT MRN→IPN .
In conclusion, we identified MRN serotonergic neurons, including those projecting to the IPN, as important mediators in the processing of rewards and aversive stimuli. Moreover, our data suggested that 5-HT receptors, including 5-HT 2A receptors in the IPN, are involved in the aversive properties associated with MRN serotonergic neuron activity. Further identification of the mechanisms underlying 5-HT 2A signaling in the IPN will provide insights into a better understanding of the processing of rewards and aversive stimuli and the biological basis of mental disorders where aberrant processing of rewards and aversive information are observed, such as in drug addiction and major depressive disorder.

Animals
All mice were handled in accordance with the ethical guidelines of the Kyoto University animal research committee (approval code: 13-41-2, 19-41-1,2,3), the animal research committee of Hokkaido University (approval code: 18-0070), and the animal research committee of Kanazawa University (approval code: AP-204167). Adult male C57BL/6J mice (7-11 weeks old; Japan SLC) were housed in groups (no more than six mice in an individual cage) in a plastic cage with wooden bedding and free access to food (MF, Oriental Yeast) and water, and kept under constant ambient temperature (22 ± 2°C) and humidity (55 ± 10%), with 12 h light-dark cycles, unless otherwise stated. Mice were randomly assigned to each experimental group.
For the fixed-time schedule task, adult male C57BL/6N mice supplied from Japan SLC Co. Ltd. (aged >70 days) or adult male and female Tph2-tTA::tetO-ChR2(C128S)-eYFP bi-transgenic mice (8-20 weeks old) and their littermates were used. Approximately, 50% of the bitransgenic mice were female. tetO-ChR2(C128S)-eYFP knock-in mice (RRID:IMSR_RBRC05454) and Tph2-tTA BAC transgenic mice (RRI-D:IMSR_RBRC05846) have been described in previous reports 39,40 . We previously demonstrated that Tph2-tTA:tetO-ChR2(C128S)-eYFP bitransgenic mice selectively express ChR2 in almost all the serotonin neurons of the DRN and MRN, and that blue light application to the raphe nuclei increased the firing rates of serotonin neurons and yellow light application returned it to the baseline 41 . These mice were backcrossed to the C57BL/6N strain for more than ten generations. Animals were housed at 25 ± 2°C and relative humidity of 40-50% in rooms where the light was turned on from 19:00 to 07:00 h. The tests were performed during the dark period (07:00 to 19:00 h). Mice were housed in groups of 3-5 until they received surgery and single-housed following surgery until the end of the experiment. Food and water were provided ad libitum until the beginning of the experiment.

Production and purification of viral vectors
AAVs were prepared according to the previous report 15 . Lenti-X 293 T cells (Clontech) were grown to 60-70% confluency, and 8 μg of pHelper, 5 μg of pAAV-DJ Rep-Cap (Cell biolabs) and 5 μg of transfer plasmid were transfected with polyethylenimine (24765-1, Polysciences). After 60-72 h of incubation, the supernatant was aspirated, and 500 μL of 1× Gradient Buffer was added to the cells on each plate and then collected. The cell suspension was frozen in liquid nitrogen for 10 min and placed in a 55°C water bath until the cells were completely thawed. This freeze-thaw cycle was repeated three more times. After the addition of 0.5 μL of benzonase (E1014-5KU, Sigma-Aldrich), the lysate was incubated at 37°C for 60 min, centrifuged for 15 min at 3000 × g, and then the supernatant was collected. A discontinuous density gradient of 15%, 25%, 40%, and 58% iodixanol was prepared in an ultracentrifuge tube, and the supernatants were dripped onto the top layer of the density gradient. The tube was ultracentrifuged at 48,000 rpm (average 248,113 × g), 18°C for 1 h in a 50.4Ti rotor (Beckman-Coulter). After ultracentrifugation, a syringe with an 18gauge needle was inserted approximately 1-2 mm below the boundary between 40% and 58% gradient buffer layers, and about 0.5 mL of solution was slowly extracted. This was aliquoted and stored at −80°C until use. The titer was measured by qPCR and estimated to be approximately 1.0 × 10 13 vg/mL.
For fiber photometry experiments and optogenetic experiments, 0-2 weeks after the viral injection, the animals were implanted with optic fiber (0.50 numerical aperture (NA), 250 μm/240 μm (clad/core); CK-10, Mitsubishi Rayon) so that the tip of the fiber was placed just above the MRN (AP ). An optical fiber was placed in a ceramic ferrule and inserted toward the target region. The ferrule was supported with a stainless-steel screw and dental acrylic. The mice were individually housed for at least 7 days to recover before behavioral analyses.
For the fixed-time schedule task, mice were anesthetized with isoflurane (4% for induction and 1% for maintenance) and fixed in a stereotaxic frame (Narishige). AAV-Tph2-CheRiff-eGFP or AAV-Tph2-Venus (400 nL) was injected through an injection cannula (35 gauge; Nanofil, NF35BL-2; World Precision Instrument) into the MRN (AP −4.4 mm, ML 1.8 mm, DV −4.3 mm at 20°). The virus solution was infused over 4 min at constant flow (100 nL/min) by a microinjection pump (Ultra Micro Pump II and Micro4 controller, World Precision Instruments), and the injector was left in place for 5 min after injection to allow diffusion. A week later, an optic fiber with a mirror tip at 45°( MA45; Doric Lenses) was implanted to target the MRN (AP −4.4 mm, ML −1 mm, DV −3.8 mm) and a chamber frame (CF-10, Narishige) was secured. Tph2-tTA::tetO-ChR2(C128S)-eYFP bi-transgenic mice received only optic fiber implantation and chamber frame surgery. After surgery, ointments containing antibiotics and steroids (Dolmycin, Zeria Pharmaceutical Co., Ltd.; Kenalog, Bristol-Myers Squibb) were applied to the wounds. A piece of jelly containing carprofen (MediGel, ClearH2O) was placed in the home cage, and the mice were housed individually and allowed to recover for seven days prior to behavioral experiments.
For pharmacological experiments, 0-2 weeks after the viral injection, the animals were implanted with a guide cannula (CXG-5.1 (T), Eicom) so that the tip of the cannula was placed into the IPN (AP −3.8 mm, ML 0 mm, DV +4.9 mm from bregma). The cannula was supported with a stainless-steel screw and dental acrylic. A dummy cannula (CXD-5.1 (T), Eicom) was inserted into the guide cannula to preserve the dosing channel until drug administration. The mice were individually housed for at least 7 days to recover.

Histology and imaging
After behavioral analyses, all mice were anesthetized under sodium pentobarbital (Somnopentyl, Kyoritsu Seiyaku, 50 mg/kg) or a mixture of medetomidine (Domitor, Zenoaq, 0.3 mg/kg), midazolam (Dormicum, Maruishi Seiyaku, 4.0 mg/kg), and butorphanol (Betorfal, Meiji Seika Pharma, 5.0 mg/kg). Mice were transcardially perfused with phosphate-buffered saline (PBS) followed by 4% paraformaldehyde (02890-45, Nacalai Tesque) in phosphate buffer (PB). After perfusion fixation, the brains were removed, and equilibrated in 15% sucrose in PB for overnight. The brains were frozen and cryosectioned into 30 μm-thick coronal sections with the cryostat (Leica CM3050S; Leica Biosystems) and stored at −80°C until immunohistochemical processing. For immunohistochemistry, the sections were immersed in 0.25% Triton X-100 ( The primary and secondary antibodies were dissolved in PBS-T with 5% horse serum and with 2% horse serum, respectively. The sections were then washed in PBS and mounted on glass with Fluoromount/Plus (K048, Diagnostic BioSystems) or DAPI Fluoromount-G (0100-20, Southern Biotech). Retrobeads was not immunostained, and the raw fluorescence was observed. Fluorescence was visualized using laser scanning confocal microscopy (FV10i, Olympus). AAV infection was verified immunohistochemically. The data points obtained from the mice with failed AAV infection, fiber implantation, or cannula implantation were excluded from the analysis. The numbers of the animals used were empirically determined and were similar to the previous reports 16,52,68,90,91 .
For verification of AAV infection, fiber implantation, and cannula implantation, target brain regions (MRN, IPN, vHP, MHb, LHb, PVT, and DRN) were determined by anatomical landmarks with the guidance of the mouse brain atlas 89 . For the determination of AAV specificity and coverage, the number of Tph2-positive and AAV-positive cells in the MRN were counted in at least 3 sections within 150 μm anteroposterior to the slice containing injection site and were averaged, from −4.16 to −4.48 mm from the bregma. The cellular resolution images of coronal sections of interest were acquired through the 10× and 60× objective, using laser scanning confocal microscopy (FV10i, Olympus) and were processed with FV10i-SW software (Olympus) and ImageJ software (National Institutes of Health).
For verification of fiber placements in transgenic mice, mice were sacrificed under deep anesthesia (urethane, U2500, Sigma-Aldrich, 2 g/ kg, i.p.) after the completion of these experiments. Each brain was rapidly removed, cryoprotected with Tissue-Tek® O.C.T Compound (Sakura® Finetek USA, Inc.), and frozen at −80°C. Coronal sections (40μm thick) were cut on a cryostat and thaw-mounted onto slides. Optical fiber placements were verified under a confocal laser-scanning microscope FV1000 (Olympus) according to the atlas 89 . Data from mice with incorrect placements were excluded from the analysis. Immunohistochemistry was performed as described previously 92 with minor changes. In brief, mice were anesthetized with urethane (U2500, Sigma-Aldrich, 2 g/kg, i.p.). Following intracardial perfusion with 4% paraformaldehyde in PBS (pH 7.2), mice brains were post-fixed overnight, placed in 0.1 M PB containing 20% sucrose, and sectioned at a thickness of 40 μm. Sections were incubated successively with 5% normal donkey serum for 20 min and a mixture of primary antibodies overnight. The primary antibodies used were mouse anti-GFP (1:1000; 012-20461, WAKO) and rabbit anti-tryptophan hydroxylase 2 (1:10,000; Nakamura et al., 2008) 93 . Sections were washed three times in PBS-T before the next incubation using secondary antibodies. The secondary antibodies used were donkey anti-mouse Alexa 488 (A-21202, Invitrogen) and donkey anti-rabbit (Cy3; 711-165-152, Jackson ImmunoResearch). Images were captured with a confocal laserscanning microscope FV1000 (Olympus) for high magnification (10× and 40×) and a fluorescence microscope BX50 (Olympus) for low magnification (4×).

Fiber photometry
To record fluorescence signals, we used a custom-built fiber photometry system fabricated according to the previous reports 94,95 with minor changes. Blue light emitted from a 470-nm LED (M470L3 or M470L4, Thorlabs) was reflected by a dichroic mirror (MD498, Thorlabs) and focused by a 10× objective lens (MRH00101, Nikon) to the tip of the fiber patch cable (M123L01 and M73L01, Thorlabs) whose another tip was connected to the implanted optical fiber. The LED power was adjusted at the tip of the optical fiber to 0.02-0.03 mW to minimize bleaching. The fluorescence was bandpass filtered (FBH520-40, Thorlabs) and focused by a tube lens (58-520, Nikon) to a CMOS camera (CS2100M-USB, Thorlabs) regulated by the ImageJ software plugin Micro-Manager 96 . The fluorescence images were captured at 10 Hz.
For photometry data analysis, regions of interest were manually drawn around the fiber on the basis of a detected image, then the average fluorescence intensity was calculated for each image. We also draw regions of interest out of the fiber on the basis of a detected image and calculated the average intensity to account for extraneous, non-fluorescence-related light potentially contributing to the signal. We subtracted this acquired offset from the fluorescence intensity for each image. We derived the values of fluorescence change (ΔF/F 0 ) by calculating (F − F 0 )/F 0 , namely by subtracting the baseline fluorescence (F 0 ) from the fiber fluorescence at each time point (F) and dividing that value by the F 0 , where F 0 was set during the 1 s before the trigger events. For fiber photometry data analysis for DRN serotonin neurons, F 0 was the baseline fluorescence signal averaged over a 1.5-s-long control time window, which was typically set at 0.5 s preceding the trigger events 16 .

Optogenetic manipulation
The optic fibers implanted in mice were connected to the optical fiber patch cables. Light emitted from the diode-pumped solid-state (DPSS) laser (Beijing Viasho Technology) was converged to the fiber optic by the FC/PC collimator, which was connected to the optical fiber patch cable. The DPSS laser was driven by the microcontroller (Arduino Uno, Arduino SRL) to generate the pulse width modulation signal. For the Tph2-eArchT and Tph2-Venus mice, green light illumination was delivered throughout the conditioning session of the CPP test (Figs. 1 and 6; 532 nm, 5 mW at the tip of the fiber, continuous, 20-s ON, 10-s OFF, for soma stimulation and axon terminal stimulation). In the selfstimulation test, brief green light illumination was delivered just after nose-poking ( Fig. 1; 532 nm, 5 mW at the tip of the fiber, 3-s continuous, for soma stimulation). For the Tph2-CheRiff and Tph2-Venus mice, blue light illumination was delivered throughout the conditioning session of the CPA test (Figs. 2, 7, and 8; 473 nm, 5 mW at the tip of the fiber, 20 Hz frequency, 10-ms ON and 40-ms OFF (20% duty cycle), for soma stimulation and axon terminal stimulation) and during sucrose solution intake from a light stimulation-paired bottle ( Fig. 3 and Supplementary Fig. 5; 473 nm, 5 mW at the tip of the fiber, 20 Hz frequency, 10-ms ON, 40-ms OFF, for soma stimulation). In Supplementary Fig. 4 and 6, to evaluate c-Fos expression, blue light illumination was delivered 20 min in a chamber of the same size as the chamber of the CPA test (473 nm, 5 mW at the tip of the fiber, 20 Hz frequency, 10-ms ON, 40-ms OFF, for soma stimulation).
For MRN manipulation during the fixed-time schedule task in AAV-injected mice, blue light (473 nm, 10 mW at the tip of the implanted optic fiber) application to the MRN was generated by a laser light source (COME2-LB473/589/100; Lucir). The light delivery was controlled via TTL pulses driven by a microcontroller board (Arduino Uno, Arduino SRL). Mice that acquired an anticipatory number of licks/ seconds superior to 1.5 were tested the next day. Based on the stable and stereotypic pattern of licking behavior that developed in this task, the anticipatory licking phase was determined as starting from −6 s and ending at 0 s, the latter corresponding to the reward delivery. A test session consisted of 3 phases of 10 min, identical to the training runs. The first phase, Pre was used as a baseline of the licking behavior developed during the training phase. During the following phase Stim, a 2-s long blue light pulse (473 nm, 10 mW, 20 Hz, 10-ms ON, 40-ms OFF) was delivered from the onset of each reward delivery. Finally, phase Post was a replicate of phase Pre to assess a potential longlasting effect of the optogenetic manipulation.
For MRN manipulation during the fixed-time schedule task and CPA test in bi-transgenic mice, blue light (473 nm, 1-3 mW at the tip of the implanted optic fiber) or yellow light (575 nm, 1-4 mW at the tip of the implanted optic fiber) application to the MRN was generated by a LED light source (SPECTRA 2-LCR-XA light engine; Lumencor). A test session consisted of 3 phases of 10 min, identical to the training runs. The first phase, Pre was used as a baseline of the licking behavior developed during the training phase, during which a 0.5-s constant yellow pulse was delivered from the onset of each reward delivery and followed 1.5 s later by a second 0.5-s constant yellow pulse. During the following phase Stim, the same light procedure was used, but the first pulse was blue. Finally, phase Post was a replicate of phase Pre to assess a potential long-lasting effect of the optogenetic manipulation.
Behavioral analyses Sucrose and quinine solution licking test. The mice were waterdeprived for 13-18 h per day for three days before the test. Before the test was conducted, the mice were placed for 5-8 h per day in a test cage (dimensions: 20.8 × 13.6 × 11.5 cm, Length × Width × Height) and allowed to drink ad libitum 10% (w/v) sucrose solution for two consecutive days to acclimate to a test cage. The next day, the mice attached with a fiber cable were placed in a test cage equipped with a bottle filled with 10% sucrose solution. The test and the fiber photometry recording were performed for 10 min in a free-moving setup. The mice had free access to the solution during the test. The timing of the lick onset was curated through analyses of recorded movies. For comparison of fluorescence response induced by different concentrations of sucrose, the mice were water-deprived, as mentioned above, for two consecutive days. Then, the mice attached with a fiber cable were placed in a test cage and were presented with 10% and 0.5% sucrose solution alternately. For the recording of the response to water and quinine solution, the mice were water-deprived, as mentioned above, for two consecutive days. Then, the mice attached with a fiber cable were placed in a test cage and were presented with a water solution. After the water solution was licking, the bottle filled with 5 mM quinine solution (quinine hydrochloride, CAS: 130-89-2; Tocris) was replaced with a bottle of water. Fiber photometry recording was performed throughout the session.
CPP test. The CPP test was performed according to previous reports 97,98 with minor changes. The CPP apparatus consisted of two equal-sized compartments (dimensions: 15 × 24 × 30 cm, Length × Width × Height) with distinct tactile and visual cues. One compartment was white with a textured floor, and the other one was black with a smooth floor. In pretest sessions on Day 1, mice without the fiber patch cable were allowed to explore two compartments freely for 900 s, and the time spent in each compartment during the exploration period was measured using ANY-maze tracking software (ANY-maze version 6.0, Stoelting). Mice that spent more than 80% (>720 s) of the total time in one compartment in the pretest or that showed a difference of >200 s in time spent in one compartment between the pretest were excluded Article https://doi.org/10.1038/s41467-022-35346-7 from the following procedures (No mice were excluded by this criterion throughout the study). We used a bias-like protocol 36 and designated the compartment in which each mouse spent less time (<450 s) in the pretest as the light stimulation-paired compartment for that animal. In conditioning sessions on Days 2 and 3, mice were connected to a fiber optic patch cable. During the first half of the day's conditioning, one group of mice was confined to the stimulationpaired compartment for 20 min with light delivery, and the other group of mice was confined to the non-stimulation-paired compartment for 20 min without light delivery. During the second half of the conditioning, mice that received light delivery in the first half were confined to the non-stimulation-paired compartment for 20 min without light delivery. The other group of mice was confined in the stimulation-paired compartment for 20 min with light delivery. The mice were randomly grouped and conditioned. An interval of at least 4 h was left between the first and second half of the conditioning. In posttest sessions on Day 4, mice without the fiber patch cable were allowed to freely explore the two compartments for 900 s, and the time spent in each compartment during the exploration period was measured. The CPP scores were calculated by subtracting the time spent in the light-paired compartment during the pretest from that during the posttest.
Nose poke self-stimulation in an operant chamber. The operant conditioning chamber (dimensions: 15.24 × 13.34 × 12.7 cm, Length × Width × Height; Med Associates) was equipped with a nose poke port (ENV-303M; Med Associates) and encased in a sound-attenuating box. Nose poking through the hole resulted in the delivery of green light stimulation (532 nm, 1 pulse, 3 s) into the target region through the optical fiber. Light stimulation was delivered for each nose-poking response. The animals were plugged into the fiber-optic patch cord and placed into the chamber, and subsequently allowed to selfstimulate the target nucleus for 30 min. The mice were allowed to perform the task for four consecutive days, and the number of nose pokes was counted for each day.
Tail pinch test. The mice attached with a fiber-optic patch cord were placed in the test cage (dimensions: 20.8 × 13.6 × 11.5 cm, Length × Width × Height) for at least 5 min for habituation before the test. Then, the experimenter gave the mice a tail pinch as an aversive stimulus. For all mice, the same experimenter performed tail pinches using a bulldog serrefine clamp (18050-50, Fine Science Tools). The pinch continued not longer than 20 s to minimize the animal's pain, and the fluorescent response before and after the pinch was recorded. The timing of the pinch was curated through analyses of recorded movies.
Cue-induced fear conditioning. Auditory cue-induced fear conditioning was performed according to the previous report 38 . On day 1, mice without fiber patch cords were placed in the operant chamber (15.24 × 13.34 × 12.7 cm, Length × Width × Height) equipped with stainless steel grid rod floor and speaker for generation of auditory cue (2900 Hz; ENV-323AM, Med Associates) for 180 s. Then, an auditory cue (30 s duration, 2900 Hz) was applied for 30 s. Foot shock (0.6 mA, 2 s duration; ENV-414S, Med Associates) was applied after initiation of the auditory cue and terminated simultaneously with the auditory cue. This cycle was repeated twice with 30-s intervals (3 shocks in total). After conditioning, the mice were returned to their home cage. On day 2, mice with fiber patch cords were placed in the chamber with different floor textures (paper towels), different odors (vanilla oil), and different illumination colors for 180 s. Afterward, an auditory cue (2900 Hz) was applied, and GCaMP fluorescence in the MRN was measured.
CPA test. The same apparatus was used as the CPP test. After the same pretest (900 s, no mice were excluded by the abovementioned criterion throughout the study) as the CPP test, we designated the compartment in which each mouse spent more time (>450 s) in the pretest as the light stimulation-paired compartment for that animal. In conditioning sessions on Days 2 and 3, mice were connected to a fiber optic patch cable. During the first half of the day's conditioning, one group of mice was confined to the stimulation-paired compartment for 20 min with light delivery, and the other group of mice was confined to the non-stimulation-paired compartment for 20 min without light delivery. During the second half of the conditioning, mice that received light delivery in the first half were confined to the nonstimulation-paired compartment for 20 min without light delivery. The other group of mice was confined in the stimulation-paired compartment for 20 min with light delivery. The mice were randomly grouped and conditioned. An interval of at least 4 h was left between the first and second half of the conditioning. In posttest sessions on Day 4, as in the CPP test, mice without the fiber patch cable were allowed to freely explore the two compartments for 900 s, and the time spent in each compartment during the exploration period was measured. The CPA scores were calculated by subtracting the time spent in the light-paired compartment during the pretest from that during the posttest.
For the CPA test with drug administration, the mice were injected intraperitoneally with vehicle (1% DMSO in saline), WAY-100635 Two-bottle choice test. The mice were water-deprived for 13-18 h per day before the conditioning before each session. In the conditioning session, the mice were placed for 5-8 h per day in a test cage (dimensions: 20.8 × 13.6 × 11.5 cm, Length × Width × Height) and were allowed to take 10% sucrose solution in two identical bottles for 30 min × two times per session a day interleaved with at least 4 h interval. The amount of solution intake (g) in each bottle was recorded in each conditioning session. In the test session, the mice attached to a fiber optic patch cable were allowed to take 10% sucrose solution in two identical bottles. One bottle was associated with light stimulation (ON bottle), and another bottle was not (OFF bottle). Light stimulation was given whenever the mice licked the ON bottle. The position of ON and OFF bottles was counterbalanced between the first and second test sessions. The amount of solution intake (g) in each bottle was recorded per test day. We summed the solution intake in the first and second test sessions for each bottle to minimize possible differences due to bottle preference and left-right bottle position. Fixed-time schedule task. After recovery from surgery, mice were water-deprived in their home cage until the end of the experiment. Mice were trained in a square behavioral chamber that was closed by an opaque black curtain. Each animal was kept in a covered elevated platform (3D printed), with its head fixed by two stabilized clamps holding the side of the chamber frame 106 . The heights of the tunnel and clamps were adjusted before each session to ensure comfort. A steel blunt-type needle was placed in front of the mouse's mouth and in front of an infrared beam-break sensor that was used to detect licks. Mice were allowed to voluntarily lick the spout. A 2 μL drop from a 10% sucrose solution was delivered through the needle every 10 s by a peristaltic pump (lab-hacks.com; no more available, but available at https://www. tindie.com/products/mindgame/high-precision-peristaltic-pump-hat/), controlled by a microcontroller board (Arduino Uno, Arduino SRL) connected to a PC via a USB cable. A custom-written python script was used to collect the timing and number of licks and saved the data in a CSV-type file. Each daily session lasted for 30 min. Mice were allowed access to water in their home cage for 30 min following the end of a daily session.
Surgery for oral infusion of sucrose solution. Tph2-tTA::tetO-ChR2(C128S)-eYFP bigenic mice were anesthetized by intraperitoneal injection of a mixture of medetomidine, midazolam, and butorphanol 107 . We implanted the optic fiber to the MRN as described above and intraoral cheek fistula to the left cheek following a previous rat study 108 , but with a few changes for mice. We used soft polyurethane tubing (0.6 mm inner diameter and 1.0 mm outer diameter; Micro-Renathane, Braintree Scientific, Inc.) connected to a 22 gauge stainless steel needle. A customized Teflon washer (3 mm diameter) was glued to the other end of the tubing.
Oral infusion of sucrose solution and taste reactivity analysis. Water-deprived mice were fixed as described above. A 5 μL drop from a 20% sucrose solution was delivered through the intra-oral tube by a pump (OPR-7200, O'Hara & Co., Ltd.), controlled by a microcontroller board (Arduino Uno, Arduino SRL) connected to a PC via a USB cable. To check whether the apparatus works well and obtain prototypical face data (see Facial expression analysis), we infused the sucrose solution first. Following that, mice received three phases (Pre, Stim, and Post) at 10 min intervals. In the Stim phase, blue light was applied to the MRN for 500 ms at the timing of delivery of the sucrose solution.
In this task, we did not apply yellow light to intentionally stop optogenetic activation because the C128S mutant spontaneously deactivates within 1-2 min 109 which is far longer than 10 min intervals. We recorded the taste reactivity through the video camera (iVIS HF R52, Canon) at 30 fps. We counted the number of tongue protrusions from the mouth as taste reactivity and hedonic response to sucrose solution 42 . As a simpler measure of taste reactivity, we also counted the number of tongue appearances whenever the tongue was visible for 10 s after the delivery of the sucrose solution.
Facial expression analysis. We acquired facial video recordings at 30 fps using a USB 3.0 monochrome camera (BFS-U3-13Y3M-C, FLIR Integrated Imaging Solutions Japan Co., Ltd.), positioned on the right side of the mouse's head. Illumination was provided by a 940 nm IR LED array (K-Light, Cony Electronics Service Co., Ltd.). Before starting the experiments, the prototypical face was recorded before and after the oral infusion of sucrose solution. The acquired images were analyzed following a previously described procedure 43 . Briefly, each frame was converted into a histogram of oriented gradients (HOG) vector. The average similarity to the prototypical face of sucrose-induced facial expression was calculated using the HOG before and after intraoral sucrose infusion. We calculated the similarity index to compare the facial expression changes among three phases (Pre, Stim, and Post) as follows: Similarity index = after À before sucrose infusion after + before sucrose infusion

Quantification of c-Fos positive cells
The mice used for c-Fos quantification were not used in any other experiments. As described above, we conducted blue laser stimulation on the mice. Ninety min after the beginning of light stimulation, the mice were sacrificed, and the brains were harvested. Cryosections were prepared and processed as described in Histology and imaging section. The number of c-Fos positive cells was quantified in at least three different sections containing the MRN, IPN, vHP, or MHb, and averaged across sections.

Drug administration
For the CPA test with pharmacological intervention, the vehicle or drugs were administered intraperitoneally 30 min before the conditioning. For the CPA test with microinjection, the mice were implanted with a microinjection guide cannula (CXG-5.1 (T), Eicom) into the IPN. Five min before the conditioning, the vehicles or drugs were injected into the IPN (AP −3.8 mm, ML 0 mm, DV +4.9 mm from bregma) using a microsyringe (7002KH, Hamilton) with an injection cannula (CXMI-5.4 (T), Eicom) attached to a polyethylene tube. The solution (0.2 μL) was infused at a rate of 0.2 μL min −1 using a syringe pump (KDS 100, KD Scientific). The injection cannula was left in place for three additional min and then slowly withdrawn, and the dummy cannula was attached.

Statistics and reproducibility
For histochemical analyses, similar results were obtained in at least 10 ( Supplementary Fig. 5c), 6 ( Supplementary Fig. 5d), 5 ( Supplementary  Fig. 5e), and 3 ( Fig. 5) mice. For histochemical analyses of 5-HT 2A receptors (Supplementary Fig. 11g, h), experiments were performed once, but a similar staining pattern was commonly obtained by different antibodies both targeting 5-HT 2A receptors as shown in Supplementary  Fig. 11g, h. For RT-PCR analyses of Htr2a and Gapdh in IPN cDNA, experiments were performed once, but similar results were obtained in two biological replicates, as shown in Supplementary Fig. 11i.

Data analysis
The data are presented as mean ± s.e.m. Statistical significances were calculated using R (version 4.0.3), GraphPad Prism (versions 8 and 9, GraphPad Software, Dotmatics), and SPSS (version 23.0). Differences between two individual groups were compared using two-tailed unpaired Student's t-tests. Differences between two-time points of the same animal were compared using two-tailed paired Student's t-tests. For photometry, data were compared using 1000 permutations for an α-level of 0.05 to compare the values of ΔF/F 0 at each time point with baseline values 16 . For group comparisons, data were compared using one-way or two-way analyses of variance (ANOVA), followed by Tukey post hoc tests or two-way repeated-measures ANOVA followed by Sidak post hoc tests. Licking behaviors in the fixed-time schedule task were analyzed using a three-factor mixed-design ANOVA with the phase and time as the within-subject factors and the virus type as the between-subject factor or two-factor repeated-measures ANOVA with the phase and time as the within-subject factors in transgenic mice. In all cases, P values less than 0.05 (P < 0.05) were considered statistically significant. The variance within each group was analyzed by F-tests or Bartlett tests. The variance is similar between groups in all figures except Fig. 3d, Supplementary Figs. 4t, 7e, l, q, r, and 9h. Thus, Fig. 3d, Supplementary Figs 4t, 7e, l, q, r, and 9h were analyzed by two-tailed unpaired t-tests with Welch's correction.

Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability
The source data are provided as a Source Data file. Other data that support the findings of this study are available from the corresponding authors on request. Source data are provided with this paper.